Jorge Luis Alcoser
Flores, Universidad Cesar Vallejo (Peru), jl.tauro.88@gmail.com Primary
Fredy Hernan Gomez Lopez, Franciso Jose de Caldas Bogota - Colombia, dygomez@gmail.com
Miguel Francisco Jarma Forero, Universidad del Magdalena - Colombia, mjarma83@gmail.com
Student Team: YES
Tableau, Excel, Sql Server 2012,Spss,Qgis
Approximately how many hours were spent working on this submission
in total?
Our team spend 56 hours working on the
challenge
May we post your submission in the Visual Analytics Benchmark
Repository after VAST Challenge 2014 is complete? YES
Video:
Youtube: UBA-Alcoser-MC2_Video
Download: UBA-Alcoser-MC2_Video
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
Questions
MC2.1 – Describe common daily
routines for GAStech employees. What does a day in
the life of a typical GAStech employee look
like? Please limit your response to no
more than five images and 300 words.
To find the daily
routines for the Gastech employees we make a correlation analysis and find the
next results
|
|
MC2.2 – Identify up to twelve unusual events or patterns that you see
in the data. If you identify more than twelve patterns during your analysis,
focus your answer on the patterns you consider to be most important for further
investigation to help find the missing staff members. For each pattern or event
you identify, describe
Evente
No.1
Evente
No.2
Evente
No.3
|
|
MC2.3 – Like
most datasets, the data you were provided is imperfect, with possible issues
such as missing data, conflicting data, data of varying resolutions, outliers,
or other kinds of confusing data. Considering MC2 data is
primarily spatiotemporal, describe how you
identified and addressed the uncertainties and conflicts inherent in this data
to reach your conclusions in questions MC2.1 and MC2.2. Please limit your response to no more than
five images and 300 words.
The problems we
encounter when analyzing the data were:
GPS data was reduced by taking only the data of
latitude and longitude, reducing it to the minimum length in minute and hour
specified. We unify the Transactions of LOYALTY_DATA and CC_DATA tables with
records that were identified as the same transaction and creating a variable
for whether loyalty card is presented when they made the transaction. Other
thing we do to reduce the data was a join between the tables cars_assigment and
credit_card transaction to see the car location of the different employees,
with their transactions, this was done by first name and last name, and hour of
the transactions.
To complement the missing values of
the varibles CURRENTEMPLOYMENTTYPE, CURRENTEMPLOYMENTTITLE we used employees
records file of the minichallenge1
|
After performing a deeper analysis of the data we found an outlier which gives us a step to think about doing other types of analysis |
We made a join between the transaction date and the GPS date, to have an
specific idea of a record in a given time, gps location, location, employee and
Price.